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Abstract 



We study large Wigner random matrices in the case when the marginal distributions of matrix 
entries have heavy tails. We prove that the largest eigenvalues of such matrices have Poisson 
statistics. 

1 Introduction and Formulation of Results. 

The main goal of this paper is to study the largest eigenvalues of Wigner real symmetric and Her- 
mitian random matrices in the case when the matrix entries have heavy tails of distribution. We 
remind that a real symmetric Wigner random matrix is defined as a square symmetric n x n matrix 
with i.i.d. entries up from the diagonal 

^4 = {cLjk), CLjk = akj, 1 < j < k < n, {djk}j<k — i.i-d. random variables. (1) 

A Hermitian Wigner random matrix is defined in a similar way, namely as a square n x n Hermitian 
matrix with i.i.d. entries up from the diagonal 

A- = (ojjfe), cijk = Okj, 1 < j < k < n, {cijk}j<k — i-i.d. complex random variables (2) 

(it is assumed quite often that the distribution of the diagonal matrix entries is different from the 
distribution of the off-diagonal entries). While our main results (with obvious modifications) are 
valid for the Hermitian random matrices with i.i.d. entries as well, we will restrict ourselves for the 
most of the paper to the real symmetric case. 

The ensembles (^Q), (J2J) were introduced by E. Wigner in the fifties ([HH]) [S3)) ISHJ) wno considered 
the case of centered identically distributed (up from the diagonal) matrix entries with the tail of the 
distribution decaying sufficiently fast so that all moments exist. Wigner proved that under the above 
conditions the mathematical expectation of the empirical distribution function of n~ 1 l 2 A converges 
as n — > oo to the semicircle law, i.e. 

-E(#(\<x, l<i<n))^F(x)= [ f(t)dt, (3) 
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where Ai,...,A n are the eigenvalues of n~ 1 / 2 A, the density of the semicircle law is given by 
f(t) = -^jV^cr 2 — x 2 , for t G [— \^2a, \/2<r] and a 2 is the second moment of matrix entries. Wigner's 
result was subsequently strengthened by many mathematicians (see e.g. ^7], pQ, [21], [Hj)- In 
particular, Pastur (jMj) and Girko ( 10J) proved that if AW = (a$), 1 < hi < is an n x n 
random symmetric matrix with independent (not necessarily identicaly distributed) centered entries 
with the same second moment a 2 , then the necessary and sufficient for the convergence of the 
empirical distribution function of the eigenvalues of n~ l ^ 2 A^ to the semicircle law has the Lindeberg- 
Feller form: ^ Yli<i<j<n f\ x \>r^/n x<2 dFij{x) —* 0, where Fij(x) is the distribution function of . 

The simplest case of (^Q) from the analytical viewpoint is when the entries o^, 1 < % < j < n 
are independent Gaussian -/V(0, 1 + 6ij) random variables. Then the ensemble is called Gaussian 
Orthogonal Ensemble (GOE). We refer the reader to |2J, chapter 6 for the discussion of the GOE 
ensemble. In a similar fashion the ensemble of Hermitian random matrices with independent and 
identically distributed (up from the diagonal) matrix entries is called Gaussian Unitary Ensemble 
(GUE, see |21| . chapter 5) amd the ensemble of Hermitian self-dual matrices with quaternion entries 
is called Gaussian Symplectic Ensemble (GSE, see [2^) chapter 7). In the Gaussian ensembles one 
can study in detail the local statistical properties of the eigenvalues both in the bulk and at the 
edge of the spectrum. In the seminal papers (j^-ilj. |32j ) Tracy and Widom proved that after the 
proper rescaling the distribution of the largest eigenvalues in the GOE, GUE and GSE ensembles 
converges to what is now called Tracy- Widom distribution. In particular they proved that for the 
largest eigenvalue of GOE 



lim^ Pr {x max < 2\/n + — ^r) -»• Fx = exp (~^J^ l( x ) + 



x — s)q 2 (x)dx I , (4) 



and for the largest eigenvalue of the GUE ensemble 



Jffin^ Pr (^Xmax < 2\/n + — j^) -> F 2 = exp (- J (x - s)q 2 (x)dx^j , (5) 

where q(x) is the solution of the Painleve II differential equation q"(x) = xq(x) + 2q s (x) determined 
by the asymptotics q(x) ~ Ai(x) at x = +oo. It was also established (see (SJ, |S2], 0) that after 
the rescaling at the edge of the spectrumas the fc-point correlation functions have a limit. In the 
GUE case the limiting fc-point correlation functions has a determinantal form 

Pk(xi,...,x k ) = det(K(xi,Xj)) lr£iJl£k , (6) 

where 

ut s u ( x Ai(x)Ai'(y) - Ai'(x)Ai(y) 

K(x, y) = K Airy (x, y) = (7) 

x — y 

is a so-called Airy kernel. In the GOE case the limiting A;-point correlation functions have a pfaffian 
form and can be written as 

/ \ 1/2 

p k (x 1 ,...,x k ) = \det{K(xi,x j )) 1 ^ i ^ k J , (8) 
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where K(x, y) is a 2 x 2 matrix kernel such that 



1 f y 

K u (x,y) = K 22 {y,x) = K Airy (x,y) + -Ai(x) I Ai(t)dt, (9) 

^ J —oo 

1 

^i2(«,y) = -~Ai(x)Ai(y) - — K Airy (x,y), (10) 



+oo \ y r x 



K2i(x,y) = l I / Ai(v)dv \ Ai{x + u)du — e(x — y) + - / Ai{u)du 



o \J x+u / J y 

+ - / Ai(u)du / Ai(u)cfe, (11) 
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where e(z) = ^sign(z). Similar formulas hold for the GSE. 

Soshnikov (HE]) see also 29 j) proved that the Tracy- Widom law for the largest eigenvalues is 
universal provided the laws of the distribution of matrix entries are symmetric, all moments exist 
and do not grow too fast. To the best of our knowledge there are no results proving universality 
in the bulk of the spectrum for real symmetric Wigner matrices. In the Hermitian case Johansson 
f |13|) proved the universality in the bulk under the condition that matrix entries have a Gaussian 
component. 

In this paper we will study the spectral properties of Wigner real symmetric and Hermitian 
random matrices in the case when the matrix entries have heavy tails of distribution. In other 
words, in addition to the assumption that are i.i.d. real (complex) random variables up from the 
diagonal (1 < j < k < n) we also assume that the probability distribution function satisfies 

G(x)=Pv(\a jk \>x) = ^-, (12) 

where < a < 2 and h(x) is a slowly varying function at infinity in a sense of Karamata ( |14j. |26|). 
in other words h(x) is a positive function for all x > 0, such that limu-^oo -j^y = 1 for all t > 0. The 
condition (|12j) means that the distribution of | \ belongs to the domain of the attraction of a stable 
distribution with the exponent a (see e.g. jllj . Theorem 2.6.1). The case h(x) = const{\ + o(l)) in 
(|T2|) was considered on a physical level of rigor by Cizeau and Bouchaud in [3] , who called such set 
of matrices "Levy matrices" (see also [2] and |12| for some physical results on the unitary invariant 
Levy ensembles). They argued that the typical eigenvalues of A should be of order of n~ . Indeed, 
such normalization makes the euclidean norm of a typical row of A to be (with high probability) 
of the order of constant. Cizeau and Bouchaud suggested a formula for the limiting distribution of 
the (normalized by n«) eigenvalues. The formula for the limiting spectral distribution has a more 
complicated density then in the finite variance case. Namely, they claimed that the density should 
be given by 

f(x) = L C Jf^\x), (13) 



a/2 
C 3 

where is a density of a centered Levy stable distribution defined through its Fourier transform 
L(k): 



= — I dkL(k)e ikx , (14) 



2vr 

InL(k) = -C\k\ a (l + i(3sgn(k)tan(ira/2)), (15) 
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and C(x), (3{x) satisfy a system of integral equations 

C{x) = J^\v\S-*L°®M\x-±)dy, (16) 

/*H-oc i 

0(x) = L C ^ y \x- l -)dy. (17) 

J x y 

Note that the density f{x) in ()13|) is not a Levy density itself, since C(x), /3(x) are functions of 
x. It was also argued in jlj that the density f(x) should decay like at infinity, which suggests 

2 

that the largest eigenvalues of A (in the case (|12|) . /i(x) = const) should be of order n.«, and not 
na as the typical eigenvalues. 

Recently, Soshnikov and Fyodorov ((301) used the method of determinants to study the largest 
eigenvalues of a sample covariance matrix A t A in the case when the entries of a rectangular m x 
n matrix A are i.i.d. Cauchy random variables. The main result of |3U| states that the largest 
eigenvalues of A 1 A are of the order m 2 n 2 and 

lim £ fdet(l + — f-^U)) = lim E Y[(l + zA~i)~ 1/2 = exp ( y^l (18) 

n— »oo \ m z n z J n^oo - 1 - - 1 V 7T / 

i=l v ' 

oo 

= E[](l + ^)- 1/2 , (19) 
i=l 

where Aj, i = 1, . . . , n are the eigenvalues of ■^^ I A t A, z is a complex number with a positive real 
part, the branch of \fz on D = {z : > 0} is such that yl = 1. E denotes the mathematical 
expectation with respect to the random matrix ensemble of sample covariance matrices and E denotes 
the mathematical expectation with respect to the inhomogeneous Poisson random point process on 
the positive half-axis with the intensity 3/2 • The convergence is uniform inside D (i.e. it is unform 
on the compact subsets of D). 

The goal of the paper is to study the distribution of the largest eigenvalues of real symmetric (^Q) 
and Hermitian (J2J) Wigner random matrices in the case of the heavy tails Let us introduce the 
normalization coefficient b n defined so that 



n 2 1 

-G{b n x) - — , (20) 

for all positive x > 0, where G is defined in (|12j) above. This can be achieved by selecting b n to be 
the infimum of all t for which G(t — 0) > \ > G(t + 0). It is clear that 

n^- & <.b n <. n^ +s (21) 

for any 5 > as n — ► 00 and - — > 2 as n — > 00. Let us denote the eigenvalues of b~ l A by 
Ai > A2 > • • • A n . As we will see below this normalization is chosen in such a way that the largest 
normalized eigenvalues are of the order of constant and the vast majority of the eigenvalues go to 
zero. The main result of our paper is formulated in the next two theorems. 

Theorem 1.1 Let A be a Wigner real symmetric 0) or Hermitian (0) random matrix with the heavy 
tails llty) and Ai the largest eigenvalue ofb^A. Then 

lim Pr(Ai < x) = exp(-x~ a ). (22) 
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Remark 1 

It is easy to see (and will be shown later) that the r.h.s. of (|22|) also gives the limiting distribution 
of the maximum of the (properly normalized) matrix entries, i.e. 

lim Pr( max &~ 1 |aj 7 | < x) = exp(— x~ a ). (23) 

Similarly to Theorem 1.1 one can study the distribution of the second, third, fourth, etc largest 
eigenvalue of A. The following general result holds. 

Theorem 1.2 Let A be a Wigner real symmetric 0) or Hermitian HJ) random matrix with the 
heavy tail of the distribution of matrix entries Then the random point configuration composed 

of the positive eigenvalues ofb~ l A converges in distribution on the cylinder sets to the inhomogeneous 
Poisson random point process on (0, +oo) with the intensity p{x) = x ^ a . 

Remark 2 As we will see below the result of the Theorem 1.2 can be reformulated in such a way 
that for any finite integer k > 1 the joint distribution of the k largest eigenvalues of b~ l A coincides 
in the limit n —* oo with the joint distribution of the first k order statistics of the absolute values of 
the normalized matrix entries {b~ \cnj |, 1 < i < j < n}. This reformulation is also indicative of the 
method of proof of the results of Theorems 1.1 and 1.2. Of course a similar result about the negative 
eigenvalues of b~ x A holds true as well. 

We remind the reader that a Poisson random point process on the real line with the locally 
integrable intensity function p(x) is defined in such a way that the counting functions (e.g. numbers 
of particles) for the disjoint intervals I\, . . . , are independent Poison random variables with the 
parameters Jj p(x)dx, j = l,...,k. Equivalently, one can define the Poisson random point by 
requiring that the /c-point correlations functions are given by the products of the one-point correlation 
functions (intensities), i.e. Pk(xi, • • • , x^) = 11^=1 P( x j)- 

Remark 3 The arguments of the proof are quite soft and can be extended without any difficulty 
to banded random matrices and some other classes of Hermitian and real symmetric random matrices 
with independent entries with heavy tails of distribution. 

Remark 4 For random Schrodinger operators the Poisson statistics of the eigenvalues in the 
localization regime was first proved by Molchanov in (see also the paper by Minami [H]). There 
is a vast literature on the Poisson statistics of the energy levels of quantum systems in the case of 
the regular underlying dynamics (see e.g. |3|. [27|. [TH]. [25]. [TK]. [T3]. fX)\). 

The rest of the paper is organized as follows. We prove Theorem 1.1 in section 2. The proof of 
Theorem 1.2 is presented in section 3. 

2 Proof of Theorem 1.1 

We will prove Theorems 1.1 and 1.2 in the real symmetric case. In the Hermitian case the arguments 
are essentially the same. We start by considering the distribution of the largest matrix entry. It 
follows from that 

f „ vn(n+l)/2 

Pr( max &^|a« I < *) = (1 - G(b n x)) nin+1 ^ = 1 - — (l+o(l)) = exp(-x-)(l+o(l)), 

i<«<j<« V n z x a J 

(24) 
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(see e.g. [TS], Theorem 1.5.1). 

Let us order the N = n(n + l)/2 i.i.d. random variables 1 < % < j ' < n, i.e. IflijjJ > 

|ai 2 i 2 l ^ I a *3 j3 1 — • • • — l a w'jvl> where are the indices of the Z-th largest (in absolute value) 

matrix entry. We will use the notation ay' = 6^ 1 |aj i jJ, I = 1, . . . ,N for the normalized l-th largest 
absolute value of matrix entries. 

In this paper only the statistical properties of a finite number of the largest order statistics will be 
of importance. In particular, for any finite k the inequalities between the first k order statistics are 
strict with probability going to 1 as n —* oo. We start with a standard proposition that generalizes 
(El- 

Proposition 1 Let < c\ < d\ < C2 < d,2 < ■ ■ ■ < c4 < +oo, and 1[ = (q,d{), I = 1, ... k, be 

disjoint intervals on the positive half-line. Then the counting functions #(//) = #(1 < i < j < n : 
6~ 1 1 a^j | G Ii), I = 1, . . . , k, are independent in the limit n — ► oo and have Poisson distribution with 
the parameters \i\ = f T p(x)dx, where p(x) = — n^. 

J 1 1 X 

In other words, for any e > the restriction of the point configuration {fr^ 1 !^!} to the interval 
[e, +oo) converges in distribution on the cylinder sets to the inhomogeneous Poisson random point 
process with the intensity p{x) = (we refer the reader to sections 2 and 3 for the definition 
and elementary properties of Poisson random point processes). In particular, 

k— 1 __i a 

Pr(a (fc) <x) ^exp(-x- Q )^^-. (25) 

1=1 

The proof of the proposition is a straightforward generalization of the calculations in (|24|) (see 
e.g. [IH] Theorem 2.3.1). 

Now we proceed to the proof of Theorem 1.1. The result of the theorem follows from 1)24(1 if we 
can show that -jjj — > 1 in probability as n — > oo, where as before Ai is the largest eigenvalue of b~ l A 
and aW = b^ 1 max.x< : i<j< n \aij\. 

We start with an elementary lemma. 

Lemma 1 a) With probability going to 1 there are no diagonal entries greater (in absolute value) 
than b}^^ 20 . 

b) With probability going to 1 there is no pair (i,j), 1 < i < j < n, such that \aij\ > ft^ 9 ^ 100 and 
I i I i i ^ 1,1/10 

c) For any positive constant 5 > with probability going to 1 there is no row 1 < i < n that has 
at least two entries greater in absolute value than b£ 



For the convinience of the reader we sketch the proof of the lemma. We start with part a) . We 
first remind the reader that by a classical result by Karamata (|T3|, see also |26) for a nice exposition 
of the subject) a slowly varying function at infinity can be respresented on the interval [B,+oo), 
where B is sufficiently large, as 



h(x) = exp ^(x) + J ~Y~ dt ^j 



(26) 
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where rj(x) is a bounded measurable function which has a limit at infinity and e(x) is a continuous 
function on [B, +00) such that lim^^oo e(x) = 0. Using (|12|). (|21|) and (|26|) we obtain 



Pr (max M < = (l - G(b^)) n = (l 




> exp(-n-3o)(l + o(l)). (27) 

In a similar fashion, in order to prove the statement b) we have to show that n ^ n ~ 1 ^ G(b n 9 ^ 100 ) x 

G{b n ,W ) < n" 1 / 20 ^ + 0(1)), which again follows from (JUJ), (JHJ) and (ESJ). The proof of part c) is 
very similar and, therefore, left to the reader. 

Let be the indices of the maximal (in absolute value) matrix element. It follows from (|24|) 

and part a) of Lemma 1 that with probability going to 1 one has that (a^jj > b n 9 and i\ 7^ j\. 
Let /1 be a unit vector in W 1 such that all its coordinates except the ii-th and ji-th are zero, the 
i\-ih coordinate is 1 and the ji-th coordinate is +1 if a>i 1 j 1 is nonnegative and —1 otherwise. Then 
one can easily calculate tha value of the quadratic form (Afi, fi) = \ai 1 j 1 \ + + ^aj 1 j 1 , and it 

follows from (|24j) and Lemma 1 b) that with probability going to 1 the sum of the last two terms is 
much smaller than the first term, and, in particular, (Afi, fi) = 1 0^1(1 + o(b n ))• Since f± is a 
unit vector, we clearly have Ai > (Afi, f\). To show that |aj Ul |(1 + o(l)) is also an upper bound on 
the largest eigenvalue we need one more lemma. 

Lemma 2 With probability going to 1 a Wigner matrix (0), does not have a row such that both 
the maximum of the absolute values of the matrix elements in that row and the sum of the absolute 

1_ _|_ a_ 

values of the remaining elements in the row are at least bn 4 • In other words, 

3_ _|_ a __ 

Pi{3i, 1 < i < n : max \au\ > bn 8 , I ^ \au\ I — max \aa\ > ^^ 8 i —* (28) 
i<i<« \tZrri, I l<j<n 

as n — > oo. 




We will treat separately the two cases 1 < a < 2 and < a < 1. 

Let us start with the first case, i.e. 1 < a < 2. We choose e = 2 m+\ > where M is some integer, so 
that e < I — f • We claim that there exists sufficiently small positive constant 7 (7 = 4j J +2 would 
suffice) that for any row 1 < i < n and for all integers < k < M 

E(#(l<j<n: \oijl > 6^)) = nG(b k n e ), (29) 
Pr (# (l < j < n : \a l3 \ > b^j > 2nG{b k ^)) < exp(-n^) (30) 

for sufficiently large n. Indeed, ()29j) immediately follows from the definition of G in (|12|) . while 
(|3U|) follows from the Chernoff's inequality Pr(X > EX + t) < exp(— + g?^p ) for a binomial 
random variable X ~ Bin(n,p) and ()12() . 1)21(1 . It then follows from l|3U|) that with probability 
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1 — 0(nexp(— n 7 )) we have that for all rows 1 < i < n of the matrix A 

M 

E M <Y.*{ l ^^ n '- M ^ b n) b n +1)e ^ 

j:\ ai j\<bk M+im2M+1) k=0 
M M 1 a 1 1 3 a 

E^ +1)£ 2nG(6^) = E^n +1)e 2n/ i (6^)6; afa < nbV f < 6l +f < -W +t . (31) 

k=0 k=0 

for sufficiently large n. 

Finally, we observe that for any fixed row i 

Pr( E M > ^! +f ) < exp(-n K ), (32) 

Af+l 3, a 

i:6™<|a lJ |<65 + ^ 

for sufficiently small positive k > 0. The proof of (|32|) is similar to the arguments given above and 
will be left to the reader. 

Since by part c) of Lemma 1 with probability going to 1 each row has at most one entry greater 

3 _|_ a_ 

in absolute value than bn 16 the bounds (|3*Tj) and (|3*2*1 imply the statement of the Lemma 1.2. in the 
case 1 < a < 2. 

In the case < a < 1 the onsideration is similar. We again choose e = 2 m+i > where M is some 
integer in such a way that e < § and 7 = 4A j" +2 so that with probability 1 — 0(nexp(— n 7 )) we have 
that for all rows 1 < i < n of the matrix A 

M 



M+l k=0 

i:|a^|<b™ 
M M 



E b^ +1 ^2nG(b k n e ) = J2 b n +1)t ^h( y b k n t )b- akt < b n 2nb n T < b\+ l < ife| + * . (33) 

fc=0 k=0 

for sufficiently large n. Combined with ()32|) this finishes the proof. Lemma 2 is proven. 
Remark 5 It follows from l|24|) and Lemma 2 that 

ll-Alloo J= m ax > I ai,- 1 = max la^-lfl + o(l)) (34) 

l<j<n 

with probability going to 1. The Theorem 1.1 now follows from the fact that the matrix norm ||^4||oo 
is an upper bound for the largest eigenvalue of the matrix A. 

3 Proof of Theorem 1.2 

Let as in section 2 a"' = 6^ 1 |ai iJz |, I = 1, . . . , N = n(n+l)/2 be the values 6~ 1 |aij|, l<i<j<n 
put in the decreasing order. In particular, = maxfj b~ l \aij\. Similarly to section 2 we construct 
the unit vectors /; S R n , I = 1,2,..., such that all coordinates of f\ except the fy-th and ji- 
th are zero, the ij-th coordinate is 1 and the jz-th coordinate is +1 if a,i l j l is nonnegative and 
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-1 otherwise. Then Afi = |a iUi |/; + ^= J2 m &,j( a krn + aj im )e m , where e m are the standard 
basic vectors in W 1 , i.e. all coordinates of e m are zero, except for the m-th one which is 1. Since 

/ \l/2 

( Em^i j a iim ) — Sm^jj l a i(m|) it follows from Proposition 1 and Lemma 2 that for any finite 

number of the largest values a^ l \ 1 < I < k one has that b^Afi = a® fi + r\, where the euclidian 
norms of r\ are o(l). Therefore by an elementary argument in perturbation theory for Hermitian 
operators we obtain that b~ x A has eigenvalues a^(l + o(l)), 1 < Z < A;, for any finite k. The 
last thing we need to show is that (with probability going to 1) these eigenvalues are exactly the k 
largest eigenvalues of b~ l A. In other words, so far we proved that with probability going to 1 one 
has A; > a^(l + o(l)), I = 1, . . . , k. Our goal now is to prove the reverse inequalities A; < a® x 
(l + o(l)), I = 1, . . . , k. To simplify the exposition we will prove the result for the second eigenvalue. 
The reasoning in the general case is very similar. Let, as before, be the indices of the largest in 

absolute value matrix element. Consider an (n — 1) x (n — 1) submatrix obtained from A by deleting 
the ii-th row and the ii-th column. Let us denote the submatrix by B. It follows from Lemma 1 
that %2 7^ ii-, ]2 7^ ji with probability going to 1. By the same reasoning as in Theorem 1 we have 
that the largest eigenvalue of b~ l B is a^ 2 \l + o(l)), but by interlacing property of the eigenvalues of 
A and B we have that the largest eigenvalue of B is not smaller than the second largest eigenvalue 
of A. Theorem 1.2 is proven. 
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